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Abstract 

In this paper we study the problem of answering connectivity queries about a graph 
timeline. A graph timeline is a sequence of undirected graphs Gi,... ,Gt on a common set 
of vertices of size n such that each graph is obtained from the previous one by an addition 

or a deletion of a single edge. We present data structures, which preprocess the timeline 

and can answer the following queries: 

• f orall(u, V, a, b) - does the path u ^ v exist in each of Ga, • ■ •, Gb? 

• exists(u, V, a, b) - does the path u ^ v exist in any of Ga, ■ • ■, Gf,? 

• f orall2(it, u, a, b) - do there exist two edge-disjoint paths connecting u and v in each 
of Ga,...,Gb? 

We show data structures that can answer forall and forall2 queries in O(logn) time 
after preprocessing in 0{m + tlogn) time. Here by m we denote the number of edges 
that remain unchanged in each graph of the timeline. For the case of exists queries, we 
show how to extend an existing data structure to obtain a preprocessing/query trade-off of 
(0(m -|- min(nt, and show a matching conditional lower bound. 


1 Introduction 

In this paper we revisit the problem of maintaining the connectivity information in a graph 
timeline. The problem was formulated and solved in a recent paper by L^cki and Sankowski 
[S]. They define a graph timeline to be a sequence of graphs Gi, G 2 ,..., G* on a common set 
of vertices V of size n such that the graph Gj is obtained from Gi_i by adding or deleting a 
single edge. Their goal was to preprocess the graph timeline to build a data structure that may 
answer connectivity queries regarding a contiguous fragment of the timeline: 

• f orall(tt, u, a, b) — are vertices u and v connected by a path in each of Ga, Ga+i, • • •, Gj,? 

• exists(tt, V, a, b) — are vertices u and v connected by a path in any of Ga, Ga+i, ■ ■ ■, G^l 

We stress that the entire timeline is revealed in the very beginning for preprocessing, and after 
that the queries may arrive in an online fashion. 

Throughout this paper, we write {f {n,m,t), g{n,m,t)) to denote a data structure, whose 
preprocessing time is f{n,m,t) and the query time is g{n,m,t). 
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In the case of f orall queries, Lqcki and Sankowski presented an {0{m+t log t log log t log n), 
O(lognloglogt)) data structure. Here by m we denote the number of edges that remain un¬ 
changed in each of Gi,..., G*. Their data structure is Monte Carlo randomized and the query 
time is amortized. For exists queries they give an {0{m + nt),0{l)) data structure. 

We improve the results of [9] and show new algorithms, which are more efficient, simpler 
and deterministic. In addition, we also develop an extended data structure that may efficiently 
answer an even more complex query regarding 2-edge-connectivity: 

• f orall2(ri, u), a, 6) — are vertices u and v connected by two edge-disjoint paths in each 
of Ga, Gq+i, ..., Gb? 

Moreover, we give new conditional lower bounds for the problem of answering exists queries, 
which also improves the results of [9]. 

1.1 Related work 

A rich body of connectivity-related dynamic problems has been studied in the area of networks 
and distributed computing. A number of such problems has been surveyed in [2]. In a typical 
scenario, we work with a sequence of graphs G* = Gi,..., G* that represent the states of an 
evolving network at different points in time. However, the properties of these graphs, which 
are of interest, such as T-interval connectivity [8] or time-respecting paths [7] are usually much 
more complex than what can be studied with ordinary connectivity queries, that is queries about 
the existence of a path connecting two given vertices in a particular graph. For example, the 
problem of T-interval connectivity consists of deciding if for every subsequence Gq, ..., Ga+r-i 
of T consecutive graphs in G*, the intersection Gq n ... n Ga+T-i of these graphs contains 
a connected component spanning all vertices. Here we define the intersection of two graphs to 
be the graph obtained by intersecting their edge sets. 

We believe that the queries we consider in this paper are powerful enough to study interesting 
properties of evolving networks. A f orall query checks if two vertices are connected with a path 
in every graph among Ga, ..., Gb, but the path can be different in each of the graphs and may 
not even exist in the intersection of these graphs. Even stronger is a forall2 query, checking 
whether two vertices are connected with two edge-disjoint paths in each graph of the given 
fragment. This may serve as a measure of robustness of connection between two nodes of 
a network. 

The algorithms that process graph timelines can also be considered semi-offline counterparts 
of dynamic graph algorithms. The updates are given upfront, but the queries may arrive in an 
online fashion, i.e. they are issued one by one, only after the preprocessing is finished. A possible 
scenario for the semi-offline model would be to collect and index the history of evolving network 
up to some point of time and then use the queries to analyze various properties of the network 
efficiently. 

It is worth noting that the knowledge of the entire history of changes in most cases leads to 
data structures faster and simpler than the best online ones. However, this property has rarely 
been exploited to design efficient algorithms. Eppstein [2] has shown an algorithm, which, given 
a weighted graph G and a sequence of k edge weight updates, computes the weight of the 
minimum spanning tree after each update in 0{{m -|- k) logn) time. 

1.2 Our results 

We show {0{m -\- t\ogn),0{logn)) data structures for answering f orall queries and forall2 
queries. The data structures use 0{t log n) space. This improves the results of [9] in a number of 
ways: our algorithms are faster and deterministic, use less space, the time bounds are worst-case 
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and the query time is independent of the length of the timeline. We also introduce forall2 
queries, which were not considered before. On top of that, our algorithms are arguably simpler. 

What is interesting, we obtain a solution for the 2-edge-connectivity problem, which is much 
more efficient than what has been achieved in the dynamic case. The best known algorithm for 
2-edge-connectivity is due to Holm et al. [5]. It processes t updates in 0{{t + m)\og^n) time, 
where m is the initial number of edges, and answers queries in O(logn) time. Our algorithm 
may preprocess the timeline in only 0{m + tlogn) time to answer queries in O(logn) time. 

In the construction of the algorithm for answering forall queries we use the following two 
observations. Consider a timeline Gi,... ,Gt- If there is an edge uw present in every graph 
among Gi,... ,Gt, vertices u and w are equivalent from the point of view of any query, so the 
edge uw can be contracted in each graph. Once we do that, we are left with 0{t) edges in 
total, each being added or deleted at some point of time. Thus, if there are much more than t 
vertices, some vertices are isolated in every Gi,... ,Gt, and can be safely treated separately 
in the beginning and removed. These ideas are then used recursively in a divide-and-conquer 
algorithm, which at each step halves the length of the timeline to compute a segment tree 
over the sequence Gi,... ,Gt. This segment tree stores connectivity information about every 
individual graph in the timeline. Here we adapt the ideas of Eppstein’s reduction and contraction 
scheme used for offline computation of minimum spanning trees [3]. 

Next, we use a fingerprinting scheme to identify vertices belonging to the same connected 
components in multiple consecutive graphs, which allows us to answer forall queries. Ad¬ 
ditionally, our fast algorithm for answering queries uses a data structure for efficient testing 
of equality of contiguous subsequences of a given sequence. This is then extended to handle 
forall2 queries. 

For exists queries, we show how to leverage the {0{m + nt), 0(1)) data structure from [9] 
to build an + data structure, where a is a parameter from the range 

[0,1), which can be chosen arbitrarily. All of the presented algorithms are simple and can easily 
be implemented. 

Moreover, we develop a conditional lower bound for the problem of answering exists queries. 
We show that answering t exists queries on a timeline of length t, consisting of graphs with 
0{t) edges, can be used to detect triangles in a graph with 0{t) edges. This implies a conditional 
lower bound of IIand improves the result of [9], where a weaker lower bound was shown. 
We also show that an combinatorial algorithm for the aforementioned problem would 

imply a subcubic combinatorial algorithm for the Boolean matrix multiplication problem, which 
would be a major breakthrough. At the same time, our improved data structure for exists 
queries may solve this problem in time, which means that it is, in some sense, optimal. 

1.3 Organization of this paper 

In Section [2] we introduce notation and give a few simple properties of segment trees, which we 
later use. Section [3] describes the basic version of our data structure, which is then extended to 
handle forall and forall2 queries. Then, in Section|3]we present an algorithm for answering 
forall queries. Next, in Section [5] we develop improved lower bounds for the problem of 
answering exists queries, as well as show that a trade-off between query and preprocessing 
time is possible. Finally, in Section [6] we discuss the possible directions of future research. 

2 Preliminaries 

A graph timeline is a sequence G* of graphs Gi,G 2 , ■ ■ ■ ,Gt, where Gi = {V, Ei). We call each 
individual graph in G* a version. For each i G [l,t) we have \Ei (B Ei^i\ = 1, i.e. Fj+i is 
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obtained from Ei by adding or deleting a single edge. We assume that the input is given as the 
set El and a list of t — 1 operations that describe, for each i G [l,t — 1], how to obtain EiJ^i 
from Ei. 

Throughout this paper we work with intervals of integers, that is [a, h\ denotes 
{a, a + 1,, b}. We say that edge (u, v) is alive in the interval [x, y\ iff (n, v) G Ej for each 
j G [x,y]. For each edge e G Fii U ... U Fij we define L{e) to be the set of maximal intervals 
such that e is alive in each of them. An edge e is called permanent iff L{e) = {[l,t]}, that is, 
it is present in every version. Otherwise, we say that e is a temporary edge. We denote by m 
the number of permanent edges. The number of temporary edges is at most t. We begin the 
initialization of our data structures by finding the sets L[e) in 0{\Ei\ +1) = 0{m + t) time. 

We denote by A+ the set of edges e such that [a,x] G L{e) for some x G [a,t], i.e., edges 
present in Ga, but not in Ga-i- Similarly, let be the set of edges e such that [x,b] G L(e) 
for some x G [1, 6]. It is easy to verify that Yl\=i + Yl\=i ~ 0{m + t). Moreover, for 
a G we have |A+| < 1, while for b G [l,t) we have |A^| < 1. 

Throughout the paper, we assume that t > n and t = 2^ for some integer B > 0. The latter 
assumption can be achieved by adding dummy graphs to the timeline. 

2.1 Elementary intervals and the segment tree 

Given t = 2^, the set of elementary intervals is defined inductively: 

1. [l,t] is an elementary interval, 

2. if [a,b] is an elementary interval, and a < b we let mid = |_^^J, and define [a, mid] and 
[mid + 1,6] to be elementary intervals as well. 

The set of elementary intervals can be naturally organized into a complete binary tree, which 
we call a segment tree. Assuming the above notation, we call left([o, 6]) = [a, mid] the left child 
of interval [a, 6]. Similarly, right([a,6]) = [mid + 1,6]. The parent interval of P is denoted by 
par(P). We first prove a few properties of elementary intervals. 

Lemma 1. Every two elementary intervals are either disjoint, or one of them is contained in 
the other. The latter is the case iff one of them is a descendant of the other in the segment tree. 

Lemma 2. Every interval [c, d] C [1, t] can be partitioned into no more than 2 log 2 ((i — c+1) + 2 
disjoint elementary intervals such that no two intervals from the partition can be merged into a 
bigger elementary interval. The partition can be computed in time 0{\og{d — c + 1)). 

Proof. If c = d, then the interval does not have to be partitioned at all. Assume c < d. Consider 
the leaves [c, c] and [d, d] of the segment tree and let P be the lowest common ancestor of these 
intervals, i.e., the smallest elementary interval which contains both c and d. Our initial partition 
is formed by the following intervals: 

• [c, c] and [d, d ], 

• if both the interval Q and its parent lie on the path from [c, c] to P (but excluding P) and 
also Q = left(par(Q)), we include right(par(Q)) (i.e. the sibling of Q) in our partition, 

• if both the interval Q and its parent lie on the path from [d, d] to P (but excluding P) and 
also Q = right(par(Q)), we include left(par(Q)) (i.e. the sibling of Q) in our partition. 

We first show that the chosen family of intervals W is indeed a partition of [c, d]. By Lemma [H 
the chosen intervals are disjoint, since there are no two such that one of them is an ancestor 
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of the other. For any interval from W, its left endpoint is not less than c, whereas its right 
endpoint is not larger than d. Hence, [_}W C [c,d]. Moreover, [c,d] C (JhF. It is clear that 
{c, d} C y VF. To show that / G (c, d) belongs to U IF consider a path from [/, /] to P. This 
path either joins the path [c, c] —)• P from the right, or joins the path [d, d] —)• P from the left. 
In both of these cases, the last interval Q of [/, /] —)• P before the paths merged {f G Q [c, d]) 
was included in W. 

Let us count the number of intervals in W. First notice, that every elementary interval 
in W is not longer than d — c + 1. Furthermore, each subsequent interval chosen from one of 
the paths ([c, c] —)• P or [d, d] ^ P) is at least twice as long as the previous interval taken while 
climbing that path. Taking into account the additional intervals [c, c] and [d, d], we get the 
bound 21 og 2 (d — c + 1) + 2. The 0(log(d — c + 1)) time can be achieved by climbing the two 
paths simultaneously. 

The above procedure does not guarantee that no two elementary intervals from W can be 
merged into a larger elementary interval. However, this can be easily fixed. Every time when we 
put into W an elementary interval such that its sibling in the segment tree is already contained 
in W, we replace the two siblings with their parent. As the lengths of the elementary intervals 
put into W only increase on a path [c, c] —)• P or [d, d] —)• P, the potential sibling can only be 
the interval that was the last to be included in W. 

Eventually, we might also end up with W = {left(P),right(P)}; then we ought to replace 
the partition with {P}. 

This hx does not influence the overall time complexity of the partitioning, which remains 
0 (log(d — c + 1)). □ 

Lemma 3. //Pi, P 2 ,..., Pfc are disjoint intervals contained in [1, t], we can partition them into 
at most 2k (log 2 ^ + l) disjoint elementary intervals. 

Proof. We use Lemma [2] to partition each of Pi,...,Pfc. For 1 < i < A:, let /* = |Pj|. Since 
the intervals are disjoint, their partitions into elementary intervals are also disjoint. Hence, by 
Lemma [21 the total size of the partition can be bounded as follows: 


k 

2k + 2 log 2 li < 

i=\ 



<2k + 2k log 2 — = 2 / ( log 2 7 + 1 


k 


k 


We used the bound Yli=i h ^ t and the Jensen’s inequality for the concave function 
/(x) = log2X. □ 

As it is much easier to work with elementary intervals, for each edge e we partition all 
intervals from L{e) into elementary intervals. 

Lemma 4. All intervals in UeGyx 7 '^(®) partitioned into 0{m + tlogn) elementary 

intervals. The partition can be performed in time 0{m + f logn). 

Proof. Denote by E* the set of temporary edges. For any e G E*, let us denote by (/e the 
number |P(e)|. We have 'i'e ^ If)A] and |P*| < min(f,n^). By Lemma [3l we conclude 
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that the total number of elementary intervals for temporary edges is at most 



+ 2t 


< 2 tlog 2 {2\E*\) + 2t 
=0{tlogn). 


Here we used the Jensen’s inequality for the concave function f{x) = a:log 2 ^ and weights 
equal to Since each permanent edge has exactly one interval in its partition, we obtain 

the desired bound 0(m + tlogn). 
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For an elementary interval [a,b], we set -F[a,fe] to be the set of edges that contain [a,b] in 
their partition. From Lemmas [5] and 0] it follows that each edge is contained in O(logt) sets 
E[a,b] and the sum over elementary intervals E[a,b] is of order 0{m + tlogn). 

3 The data structure 

We now describe a tree-like data structure T, which is a crucial part of all our algorithms. In 
the following we reserve the name T for this particular data structure. The data structure T 
is based on the set of all elementary intervals organized into a complete binary tree. This 
tree has a single node T^a,b] for each elementary interval [a, 6 ]. Denote by G[a,fe] the graph 
(y, Ea r\ ... r\ Eb). Roughly speaking, our goal is to associate with T\^a,b] the information about 
the connected components of G[a, 6 ] • We first give a simple approach for constructing the data 
structure T, and then show how to speed it up. We use the following fact. 

Lemma 5. Let [a, 6] he an elementary interval such that Then 

'^(G*par([a,b])) D • 

Proof. Recall that E{G[a,b]) = E{Ga) n ... n E{Gb). Thus, Fl(Gpar([a,fe])) ^ E{G[a,b])- More¬ 
over, directly from the definitions we have E^^^b] ^ E{G[afi])- R remains to show E{G[a,b]) ^ 

-®(^par([a,b])) D 

Consider an edge e E E{G[afi]- If e is alive in some interval [c,d] D such that 

par([a, 6 ]) C [c,d], we have e E F^(Gpar([a,fe]))- To complete the proof it remains to consider 
the case when e is alive in an interval [c,d], and par([a, 6 ]) % [c,d]. We show that in this 
case e E In other words, [a, 6 ] E X, where X is the partition of elements of L{e) into 

elementary intervals. 

From par([a, 6 ]) ^ [c, d] it follows that no ancestor of [a, 6 ] is a part of X. At the same 
time, every elementary interval that is neither an ancestor nor a descendant of [a, b] is disjoint 
with [a,b] (Lemma [T]), so it does not belong to X either. Therefore, each elementary interval 
that belongs to X and intersects [a, b] is either [a, b] or one of its descendants. Consequently, X 
contains a subset of disjoint elementary intervals Y such that (J T = [a,b]. Suppose Y ^ {[a, b]}. 
Let Qi be the shortest interval from Y. Note that Qi 7 ^ [a, b]. Also, by Lemma[2l the sibling Q 2 
of Qi is not contained in Y. As Qi is the shortest, no descendant of Q 2 is contained in Y. 
Moreover, as Qi E Y, no ancestor of Q 2 is contained in Y. Hence, Y does not cover the integers 
from Q 2 , a contradiction. Thus, we have Y = {[a, 6 ]}, so [a, 6 ] E X and, as a result, we have 
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In the simple approach, we associate with T\^a,b\ ^ graph S[a,b]^ which has a single vertex for 
each connected component of G[a,fe]) and does not contain any edges. By Lemma O G[a,b] is 
obtained from G'paj.([a,b]) by adding some edges. This implies that each component of C?[a,b] is a 
sum of some components of Gpar([a,fe])- To compute S[a,b] we build a graph H on a vertex set 
F(5par([a,fe])) and add to it edges of L^[a,fe] (each edge endpoint has to be mapped to its connected 
component in Gpar([a,6])) and then find its connected components. These components are exactly 
the components of Cr[a,fe] ■ Observe that during this computation we may also compute a mapping 
between the vertices of 5'par([a,fe]) and S[a,b]- la the case of S'ji we compute a mapping between 
individual vertices and connected components of G[i,t]. 

T represents the connected components of every graph in the timeline. Consider a graph 
Gc- In order to find a connected component of a vertex v in Gc, we traverse the path in T from 
to T^c,c]- We compute the connected component of vertex v in every graph G^afi] on the 
path. Observe that if we know the connected component of v in Gpar([a,fe])! we may compute 
the connected component of v in by following the mapping between the components of 

Gpar([a, 6 ]) and G[a,fe]- At the end of the traversal, we find the component of v in G[c^c] = Gc- 

3.1 An efficient construction 

In order to compute the data structure T efficiently, we need to make an additional optimization, 
which is crucial for obtaining good running time. 

Consider an elementary interval [a, b] and a connected component G of . Assume that 
within the graphs Ga,---,Gb no edge incident to a vertex of G is ever added or deleted. In 
other words, the edges incident to vertices of G are the same in each of Ga, • • •, G;,. This means 
that in each of Ga, ■ ■ ■ ,Gb vertices of G are connected to each other, but not connected to any 
vertex outside G. Hence, G is also a connected component in each of Ga, • • •, G;,. 

As a result, there is no need to store G in the descendants of T^a,b]- When searching for a 
connected component of a vertex v £ G in Gc, where c G [a,b], we may simply stop the search 
in the representation of G in T^a,b] ■ This observation will be used in the reduction phase of the 
construction of the tree T. 

We now describe the efficient construction of the tree T. For each node T^a,b] of T, where 
[a, b] is an elementary interval, we compute a graph Sjq b]. The vertices of <S'[a^b] correspond to 
some of the components of G[a,b] ■ We say that u G H is represented in if there is a vertex 
s G F(5[a b]) that corresponds to a component containing v. The graphs S[a,b] have no edges0 

Let [a,b] be an elementary interval. is computed based on 5'par([a,b]) (or (H, 0), if 

[a,b] = in two phases called reduction and contraction. 

In the reduction phase some vertices oiH = 5'par([a,fe]) removed, as they are not affected by 
any edge addition or deletion that is carried out among Ga, • • •, Gfe. Namely, we mark endpoints 
of edges in F = E^cL,b] U Uta+l U Ui=a Aj and then remove the unmarked vertices. Note 
that the sets Eya^b] i A~ contain edges of the original graph, so their endpoints have to 

be mapped to the corresponding vertices of H. The reduction phase is performed only when 
6 — a + 1 < n. It is done by a call Reduce(H, F), which produces a pair [S',M), where S' is 
the reduced graph and M is a mapping between V (5'par([a,b])) and V(S') U {T}. The value of T 
means that a vertex has been removed and does not have a corresponding vertex in S'. The 
procedure can be implemented with a simple graph search to work in 0(|F| + |F|) time. 

In the second phase, called the contraction phase, some of the remaining vertices of H = S' 
are merged to form Specifically, the components formed in S' after adding edges F = Ey^^b] 

are contracted. Again, we use a function Contract(F, F), which produces a pair {S',M) 

^Defining a graph with no edges may look confusing. However, we define 5'[a,6] to be a graph, as we add edges 
to in our data structure for 2-edge-connectivity. 
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consisting of the contracted graph S' and the mapping between H and S'. This function can 
also be easily implemented to work in linear time. 

Consider an elementary interval P. Together with Sp, the node Tp stores two tables Ip and 
rp mapping vertices of Sp to C(5ieft(p)) U {T} and P(S'i.ight{p)) U {T} respectively. If lp[A:] /T, 
lp[A:] is the vertex of 5'ieft(p) that corresponds to k G V{Sp). lp[k] =T means that P is a leaf, 
or there is no vertex corresponding to k in 5'ieft(p). The table rp is defined analogously. For 
simplicity, we also assume that is a left child of a special node ^[ 0 , 00 ] and S'jq^oo] = 
so that for each v gV, 1 [o,cxd][^] points to the vertex of S'ji representing the original vertex v. 

The graphs Sp along with 1 and r pointers are sufficient to find the component of any vertex 
V in any of Gi,..., G*. To access the component of vertex v in Gc we start at vertex v in 5[o,oo] 
and follow 1 or r pointers in order to reach the leaf T[c,c]- The traversal stops once we reach T^c,c] 
or the pointer we want to use (l[A:] or r[A:]) is equal to T. Let P be the elementary interval, 
where the traversal finishes and k be the vertex in Sp, which we reached. Then, as we later 
show, (/c, P) uniquely identifies the component of vertex v in Gc- The above process can be seen 
as a function COMP-lD(t(;, a, 6 , c) that follows the path to T\^c,c] starting at vertex w G F(S[a^ 6 ]). 
The pair {k,P), defined as above, is what the call Comp-Id(1[o^oo] H) Ip; c) returns. The full 
text of the Comp-Id function is given in Appendix O 

Lemma 6 . Let 1 < c < t. For any u G V, denote by {ku,Pu) the value returned by 
Comp-Id(1[o^cxd] H) 1) t, c). Then, two vertices v,w G V are connected by a path in Gc iffkv = k^ 
and Py = Pw 

Proof. Observe that [c, c] C P^ and [c, c] ^ Pw First, assume that Py ^ P^, which, by LemmaH] 
means that one of the intervals contains the other one. Without loss of generality suppose that 
Pv ^ left(P^). Then k^ is a component of Gp„ that is not incident to any changes in the 
time interval left(P^), while v is in some component of Gp^ that undergoes changes in left(P^). 
Thus, these are different components. If Py = Pw, then both ky and ky, are components of Gp„ 
not incident to any changes in the time interval P„. Both v and w, however, are represented in 
Sp^, so they are in the same component iS ky = ky,. □ 

Let us bound the time needed to build T. We begin with an auxiliary lemma, whose proof 
is based on the fact that we perform the reduction. 

Lemma 7. Let [a,b] be an elementary interval. Then < min( 8(6 — a + l),n). 

Proof. Clearly, < n. For 6 —a + 1 > n, we have 8(5 —a+ 1) > n, so |F(5'[a^b])| < n < 

min(n, 8(5 — a + 1)) holds. Assume that 5 — o + 1 < n. The graph 5'jais constructed from 
5'par([a,fe]) by applying reduction and contraction. Let G = A~. The reduction 

produces a graph S' of at most ^ U G| vertices. The contraction does not increase the 
number of vertices. Therefore, |F(<S'[a^b])| < 2|P[a fe] U G|. 

Let par([a, 5]) = [ai,5i] and consider the analogous set Gi for par([a,5]), i.e. 

Gi = Uiiai+i bl Uiiai Tor 1 < i t we have |A^| < 1 and for 1 < i < t we 

have |A“| < 1. Moreover, since par([a, 5]) = [oi, 5i], we have 5i — oi + 1 = 2(5 — a + 1). Thus, 
|Gi| < 2(5i — ai + 1) = 4(5 — a + 1). To complete the proof, we show that both G and P[a,fe] 
are subsets of Gi, which implies that |F(5'[a^6])| < 2\E^a,b] U G| < 2|Gi| < 8(5 — o + 1). Clearly, 
G C Gi, as the sum in G goes through less summands than the sum defining Gi. To show that 
E[a,b] b Cl, consider e G E[a,b]- Suppose that [a,5] is the left child of par([a,5]) = [ai,5i]. We 
show that e G A”. By Lemma El we have both e ^ P[ai,bi] and e ^ Pright([ai,fei])- Thus, 

the edge e is deleted in some version Gj for j G [5+ l,5i], which means e G ^J-i- Analogously 
we prove that if [a, 5] = right([ai, 5i]) then e G Uiiai+i ■ Heace e G Gi. □ 


To build T we use a recursive procedure COMPUTE-TREE(a, 6 ), which computes the subtree 
rooted at T^a,h]- It produces each graph S'ja;,] based on <S'par([a,fe]) by applying reduction and 
contraction. During the computation of T, we maintain an auxiliary table repr, fulfilling the 
following invariant; both at the beginning and at the end of the call COMPUTE-TREE(a, 6 ), 
repr[u] is the vertex of 5'par([a,b]) representing u G D, if such vertex exists. Initially, we have 
repr[u] = v, which does not break the invariant, as we have previously set par([l, t]) = [0, oo] and 
5[o^oo] = The repr table is used implicitly by the procedures Reduce and Contract 

to map the endpoints of edges from T^[a,fe]) A“ to vertices of 5'par([a,b]) in constant time. 

All the computed tables use linear space and can be accessed in constant time, as we can 
identify the vertices of introduced graphs Sp with natural numbers {1,2 ,...} and the T value 
with 0. The total used space is asymptotically no more than the time spent on computing T, 
that is 0{m + tlogn). 
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procedure COMPUTE-TREE(a, b) t> [a, b] - elementary interval 

P := par([a, 6]) 

if b — a + 1 < n then > Reduction is only done for short elementary intervals. 

c := Ut„+1 A+ u Ut' A- 

U :=vertices of V incident with any edge of U C 

{S',M') := red\jce{S p, Eia,b] U C) 


else 

U :=V 

(5',M0 = (5p,id) 
for tt G [7 do 

nieni[«] = repr[tt] 
for tt G 1/ do 

repr[ri] := M'(repr[tt]) 

{S",M") := Contract {S', Ei^a,b]) 

Q _C// 

^[a,b] ■— ^ 

for k G <S'[a^b] do 
-I[a,6][^] • T 

r[a,6][fci :=T 

for u £ U do 

repr[ri] := M"(repr[7i]) 

for s G Sp do 

s' := M'(s) 
if s' then 
s' := M"(s') 
if [a,b] = left(P) then 
lp[s] := s' 
else 

rp[s] := s' 

if a <b then 

mid := 

COMPUTE-TREE(a, mid) 
COMPUTE-TREE(mid + 1, 6) 

for u £ U do 

repr[u] := nieni[u] 


0 remember old repr values 


0 M" maps V{Sp) to R(5'[a^fe]). 
i> initialize 1 and r pointers to T 


0 set the parent 1 and r pointers 


i> compute the children 


i> restore repr to the initial state 
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Lemma 8 . The total running time 0 / Compute-Tree(1, t) is 0{m + t\ogn). 

Proof. We first analyze the time spent in the call COMPUTE-TREE(a, 6 ), excluding the work 
in recursive calls. Let C be Ui=a+i Uta ^7■ Thus 0{\C\) = 0{b — a). Recall that the 
functions CONTRACT and Reduce run in linear time. For 6 — a + 1 > n, we only perform 
contraction of L^[a,fe] in a graph of size 0(n), which requires 0(n + |iii[a,fe]|) time. The amount 
of work for 6 — a + 1 < n can be bounded by 0(|R(*S'par([a,b]))l + IC^I + |T'[a^b]|), as Reduce is 
passed the edges C U L'[a, 6 ] • 

To complete the proof, we sum these running times over all elementary intervals. The term 
\Ea^h\ appears in both cases and, by LemmalU we have YlpEp = 0{m + tlogn), thus we can 
focus on the other summands. For the case h — a + 1 > n, the remaining work is 0(n), but there 
are only O(^) such intervals, so the total work is 0(t). On the other hand, if 6 — a + 1 < n, by 
Lemma[71 0(|R(5'par([a,fe]))l) = 0{b — a), so the total work is 0{b — a). Hence, the total work on 
each level of the tree such that its elementary intervals are shorter than n, is 0{t). The number 
of such levels is O(logn), which gives O(tlogn) total time. The lemma follows. □ 

Having computed T, the function COMP-lD allows us to access the component of some 
vertex v in Gc in time O(logt). However, as we now show, this can be speeded up to O(logn) 
time. Recall that t = 2^. Let 2^ be the smallest power of 2 such that 2^ > n and fix some 
k G [0,2^“^). Then, for each c G [A; • 2^ + 1, (A; + 1) • 2^], the call Comp-Id(1[o,oo] H) IjLc) 
descends down T through the first B — D levels in the same way, independent of c. We can 
thus add another preprocessing phase, building the table shortcut. For a vertex v and 0 < k < 
2^~^, short cut [n] [A:] is defined to be a pair {s,P) such that for c G [A: • 2^ + 1, (A: + 1) • 2^], 
Comp-Id(1[q 00 ] [n], 1, t, c), after going through at most B — D levels of T, ends up in the interval 
P and s G V{Sp) represents v. There are only 0{t/n) allowed values of k,so the table shortcut 
has size 0{t). 

The table can be computed by finding the components of each vertex v in all the graphs 
Sp from the first B — D levels of the tree. As the component of n in Sp can be computed in 
constant time based on the component of v in 5'par{p), we spend 0{t/n) time for each n, and 
thus 0{t) time in total. 

The optimized procedure Comp-Id starts by looking up the shortcut through first B — D 
levels of T and then calls the original Comp-Id, starting at an elementary interval of length 
0(n). Thus, its running time is O(logn). 

3.2 2-edge-connectivity 

As in the case of connectivity, we first show how to preprocess the graph in order to efficiently 
answer 2-edge-connectivity queries regarding individual versions. Our approach is similar to 
the idea of Section ED we construct a data structure T containing graphs where [a, 6 ] is 

an elementary interval. Note that in the case of connectivity, the graphs iSja b] do not contain 
any edges. 

First, observe that contracting 2-edge-connected components yields a forest. 

Lemma 9. Let W be the set of 2-edge-connected components of some graph G. Define the graph 
H = (VF, F), where 


F = {{wi,W 2 ) : {u,v) G E{G),u G wi,v G W 2 ,wi 7 ^ W 2 }. 

Then, H is a forest. 

Proof. Indeed, if there was a cycle W 1 W 2 ... WkWi in H, then the components wi,... ,Wk would 
form a single 2-edge-connected component. □ 
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In the case of 2-edge-connectivity, the graphs 5'[a,fe] are forests of rooted trees, whose vertices 
represent some of the 2-edge-connected components of G[a,b]- Each rooted tree in the forest 
represents a part of some (ordinary) connected component incident to some edges alive in the 
time interval [a, b]. 

The vertices of are partitioned into two categories. A vertex s G V{S[a,b]) is a simple 
vertex if and only if it represents a single 2-edge-connected component of G^a,b] ■ Otherwise, s 
is called a path vertex and it represents k {k > 2) 2 -edge-connected components of G[a,b] — 
Cl, ..., Cfc — that form a “path”, i.e. for each i G [1, k) there is a single edge in Gp connecting 
some vertex of c* and some vertex of Cj+i. We maintain the following invariants. 

1 . If s is a root of its tree, or its degree in Sp is other than 2, then it is a simple vertex. 

2. A path vertex is never adjacent to another path vertex. 

In particular, each path vertex is of degree 2 in Let s be a path vertex representing a 

path Cl, C 2 ,..., Cfc of 2-edge-connected components of G[a,b]- E si is a parent of s and S 2 is a 
child of s, then the edge (si,s) G E{S[ci,b]) is actually an edge between components si and ci, 
while the edge (s,S 2 ) £ E{S[a,b]) actually means (cfc,S 2 )- 

The components ci, C 2 ,..., c^ of a path vertex s G E(•S'ja {,]) have the following property: for 
each X G [a, b], either ci, C 2 ,..., c^ are actual 2-edge-connected components of Gx or they are 
all parts of a single larger 2-edge-connected component of Gx- Thus, the path vertex s allows 
us to trace components Cj in the descendants of T^a,b] iii ^ uniform way. This in turn will later 
allow us to keep the graphs small. 

Recall that in the case of connectivity, the call Comp-Id(1[o_oo] ['^]) 1) L x) returns a pair (s, Q) 
{s G R(5q)), where Q is the last interval on the path [l,t] ^ [x,x\ in the segment tree such 
that V is represented with s in V{Sq). The introduction of path vertices forces us to modify 
what Comp-Id does, as we need to distinguish between distinct 2-edge-connected components 
that are represented by the same path vertex. Now for x G [a, 6 ] and k G we set 

Comp-Id(A:, a, 6, x) to be the pair (s, Q), where s is a representation of k in Sq and Q is the last 
interval on the path [a, b] —)• [x, x] such that s is a simple vertex. To access the 2-edge-connected 
component of u G E in version Gx, we use the same call Comp-Id(1[o,oo] N) ^,t,x). 

The tables and are defined analogously; for s G E(5[(j;,]), if l[a^fe][s] =T, then s 
is not represented in 5'ieft([a,fe])- Otherwise, is a vertex of 5'ieft([a,b]) representing s. The 

table ^[a,b] is defined analogously. 

Having defined graphs we now describe how they can be computed. As previously, we 

compute •S'ja fe] based on <S'par([a,b])) by performing first the reduction and then the contraction. 
The reduction is again performed only if 6 — a -|- 1 < n. 

The reduction proceeds in phases. The initial phases involve marking some nodes of 5'par([a,fe])> 
whereas the latter phases reduce the graph’s size. The path vertices never get marked; they can 
be instead merged with other path and simple vertices, forming “longer” path vertices of *S'[a^b]. 

Let C be again Uta+l the first phase we mark all the vertices of *S'par([a,fe]) 

incident to edges in U G. It might be the case that for some original edge {u,v), u and v 
are already represented by the same s in <S'par([a,b]) — then we just skip this edge. As a result, 
no more than 2\E\^a,b] U G\ vertices of 5'par([a,fe]) are marked. 

In the second phase we mark all the lowest common ancestors of marked vertices, that is, 
the vertices s such that in the first phase, the vertices from at least two distinct subtrees rooted 
at children of s, were marked. The common ancestors can be marked in linear time, using 
post-order traversal — we only need to store for each vertex s, whether any element of the 
subtree rooted at s was marked in the first phase. Additionally, we mark the root of every tree 
with at least one marked vertex. 
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Let us count the vertices marked after the second phase. Remove the subtrees with no 
marked vertices. Let q be the number of marked vertices of degree 2. If we replace all the 
degree 2 vertices with edges, we obtain a forest, where every vertex that is neither the leaf nor 
the root, has at least 2 children. Denote by I the number of leaves in this forest. Clearly, it 
has at most 21 vertices. However, every leaf could be marked only in the first phase and hence 
I + q < 2\E^a,b] ^ C\, so 21 + q < 4:\E^a,b] U Cl. 

Corollary 1. After the second phase of the reduction, at most A\E\^a,b]'A C\ vertices of Spar{[a,b]) 
are marked. 

The third reduction phase removes the subtrees with no previously marked vertices. All 
the 2 -edge-components represented by vertices from those components look exactly the same 
in G[a,b] as well as in all the individual versions Ga, ■ ■ ■ ,Gb and thus need not be tracked in the 
descendants of Tfa.fe] • 

In the last phase we replace every remaining path of degree 2 unmarked vertices with a single 
path vertex. These vertices may include both simple and path vertices. However, neither of them 
has been marked, so for each x G [a,b], the underlying path of 2 -edge-connected components 
Cl,... ,Cg either remains unaltered in Gx or is a part of a single, larger 2 -edge-component in Gx. 

Since the number of such paths does not exceed the number of vertices marked so far, we 
end up with a forest S' of at most U C| vertices. 

Each phase of the reduction can be implemented as a simple graph search, so the reduction 
takes time 0(|E(5’par([a,6]))l + \E[a,b] \ + 1^1). 

After the reduction comes the contraction. We extend the forest S' with the edges E[a,b] 
alive in each of Ga,... ,Gb. We merge the 2-edge-connected components found in this graph 
into new, simple vertices, obtaining a new graph S", which is again a forest. It may happen 
that some vertices of S' have not been merged into larger components in S". Every such vertex 
s G V{S') is a path vertex in S" iff it is a path vertex in S'. The roots of trees of S" are 
chosen arbitrarily, but keeping in mind that the trees should not be rooted at path vertices. 
The properly rooted S" forms our graph Sp. Contraction can be implemented to work in time 

0 (l^(V(M))l + |i^Ml)- 

Let us bound the time needed to compute 5'[a,fe] • If 6 — a -|- 1 > n, then the reduction is 
skipped and thus the time spent on building is 0(n + Otherwise, the reduction 

is performed and we spend 0 (|E(<S'par([a, 6 ]))l + |C| + l-®[a, 6 ]l) time. 

The asymptotic running time of building 5'[a fe] turns out to be exactly the same as in the 
case of connectivity. Thus, building a data structure T for representing 2-edge-connectivity 
takes the same time. 

Corollary 2. We ean build a data structure T representing 2-edge-eonneetivity in a graph 
timeline in 0{m + tlogn) time. The space usage is Oftlogn). 

The optimization allowing the evaluation of Comp-Id in time O(logn) applies here as well. 


4 Answering forall queries 

In this section we show how to extend the data structure T, so that it can be used for answering 
forall queries. The preprocessing for forall queries constitutes another phase, that we apply 
only after we computed the data structure T. 

Let us begin with a simple observation. Assume that we want to answer a f orall(ii, w, a, b) 
query, where [a, 6 ] is an elementary interval. Then, if the same vertex of f,] represents both 
u and w, then there is actually a path between u and w in G[a,b] and we can immediately give 
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a positive answer. However, the reverse relation is not true. It may happen that u and w are 
represented by distinct vertices in S[a,b]^ but are connected in each of Ga, ■ ■ ■, Gb- Thus, our first 
goal in this section is to compute, for each two vertices in each of whether the vertices 

represented by them are connected in each of Gq, ..., Gf,. 

For an elementary interval [a, 5], let C[a^b](s,x), where s G x G [a,b], be the 

result of the call Comp-Id(s, a, 6, x). Our goal is to compute for each vertex s G a 

fingerprint, that is, an integer H]^a,b]{s) G [1, |H(5[a,;,])|] with the following property: the se¬ 
quences C[a,6] (s, a)c[a,b] (s, o -M)... C[a,b] («, b) and C[a,b] {s', a)c[a,b] {s', a + 1)... C[a,b] {s', b) are equal 
iS H[a,b]{s) = H[a,b]{s'). 

To answer a f orall(u, u, a, 6) query, where [a,b] is an elementary interval, we first map 
u and V into vertices u' and v' of and then report a positive answer iff = 

^[a,b\{'^')- III order to handle arbitrary intervals, we decompose the query interval into O(logt) 
elementary intervals. The decomposition as well as the mapping can be implemented as a 
function Forall-Aux(si, S 2 ,x, y, a, b), whose pseudocode is given in Appendix^ To answer a 
f orall(u, V, X, y) query we execute Forall-Aux(1[o [u], l[o,oo] [w],x, y, 1, t). 

Let us now describe the computation of fingerprints. They are computed in a bottom-up 
fashion, starting from the leaves of T. 

Lemma 10. Let P = [a,b] be an elementary interval and s G V{Sp). Define: 


Hp{s) 


(s, 0 ) iflp{s)=± orrp{s)=± 

{Hieft{P) (Ip [s]), Hrighti^p) (rp [s])) otherwise. 


Then cp{si,a)... cp{si,h) = cp{s 2 ,a)... cp{s 2 ,b) iff Hp{si) = Hp{s 2 ). 

Proof. If lp[si] =T, then cp{si,a) = {si,P) and for each S 2 G V{Sp) such that si / S 2 , we 
have cp{s 2 ,a) / {si,P). The pair Hp{si) = (si,0) is unique among the pairs Hp{s), so we 
have cp(si, a)... cp{si, b) = cp{s 2 , a)... cp{s 2 , b) iff si = S 2 - Analogously, if rp[si] =T, then 
cp{si,b) cp{s 2 ,b) for si S 2 and thus si is given a unique pair Hp{si) = (si,0). Therefore, 
if lp[si] =T or rp[si] =T, then Hp{si) = Hp{s 2 ) is equivalent to si = S 2 . 

It remains to consider the case, when si and S 2 are represented in both S'ieft(p) and 5'i.ight(p)- 
Let m = [(a -|- b)/2\. By the definition of the pair Hp{s) we have cp{si,a)... cp{si,m) = 
cieft(P)(lp['Si],a) • • • cieft(p)(lp[si],m) as well as cp{si,m + 1)... cp{si,b) = Ci.ight(p)(rp[sj], m -h 
1)... Ci.ight(p)(i'p['Si], b). The sequences cp{si, a)... cp{si, b) and cp{s 2 , a)... cp{s 2 , b), are equal 
exactly when their corresponding halves are equal, that is, by the definition of fingerprints H, 

iff I^ieft(P)(lp['Si]) = iLieft(P)(lp[s2]) and FI,ight(P)(rp[si]) = FI,ight(P) (rp[ 52 ]) • □ 


Observe that the pairs Hp{s) from the above lemma satisfy the desired properties of finger¬ 
prints, with the exception that they are pairs of integers, not integers. Thus, in order to compute 
the values Hp{s), it suffices to map the values of Hp{s) into distinct positive integers (two pairs 
are assigned the same integer iff they are equal). As both numbers in each pair P[p{s) are at 
most 0(|H(S'p)|) we may compute the mapping in linear time by using radix-sort algorithm. 
Note that this resembles the Karp-Miller-Rosenberg |B] algorithm. The total additional time 
and space used is 0(^p |H(5p)|) = O(tlogn). Thus, we obtain an {0{m + t log n),0 {log t)) 
data structure for answering forall queries. 

However, the query time can be made independent of the length of the timeline and speeded 
up to O(logn). In order to do that, we employ a shortcutting technique similar to the one 
used for finding connected components of vertices in individual graphs combined with a data 
structure for comparing the subwords of a given word. 

Assume again that D is the smallest integer such that 2^ > n. Observe that the shortcut 
table from Section [3] allows us to speed up forall(u, u, x, y), where [x,y] C [A;- 2 ^-|-l, (A:-|-l)- 2 ^] 
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for some k E [0,2^ ^). Let short cut [n] [A:] = {su,Pu) and short cut [u] [A:] = We only 

need to perform the following steps: 

1. If Pu ^ Pv, then the answer is false. 

2. If Pu = Pv, but the length of Pu is more than 2^, then the answer is true if and only if 

Su - Sy. 

3. Otherwise, = [A: • 2^ + 1, (A: + 1) • 2^] and the answer can be obtained by calling 

Forall-Aux(s.u, Sy, x,y,k-2^ + 1, (A: + 1) • 2^). 

Only the third steps takes superconstant time, namely 0(log2^) = O(logn). 

Consider the general query f orall(u, u, x, y), where [x,y] ^ [A: • 2-^ + 1, A: • (2^ + 1)], for 
any k. Let h be the smallest integer such that x < h ■ 2^ + 1 and I 2 be the largest integer 
such that Z 2 • 2^ < y. Then, our query can be split into the conjunction of three queries: 
f orall(tt, V, X, h ■ 2^), f orall(u, v, l2-2^ + 1, y) and f orall(u, v, li-2^ + 1 ,12 ■ 2^) (we assume 
the last query to be true if li = I 2 ). The first two can be answered in O(logn) time, as 
discussed above. We deal with the third one in a different way. Assume li < l 2 - For any I, let 
short cut [u][/] = {su,Pu) and short cut [u][^] = {sy,Py). Dehne hy := {Hp^{su),Pu)- Observe 
that the answer to f orall(u, v, 1-2^+1, (/+l)-2^) is affirmative exactly iff h}^ = h^, which follows 
immediately from the definition of fingerprints. Thus, to answer f orall(u, v, li -2^ + 1 ,12 ■ 2^), 
we need two check if the sequences and are equal. We use 

the following algorithm, described for instance in [3]. It uses the linear construction of a suffix 
array and the optimal range minimum query structure. 

Lemma 11 ([3]). There exists a data strueture that, after a linear preprocessing of the word 
W, allows us to cheek in time 0(1) if two subwords ofW are equal. 

Let Xy = hyhy ... hy^ and let X be the concatenation X 1 X 2 ■ ■ -Xy. Notice that the 
length of X is 0{t). We build the data structure of Lemma fTTl for the sequence X in 0{t) time. 
Hence, we can answer the query f orall(u, u,• 2^ + 1,^2 • 2^) by comparing the appropriate 
two subwords of X of length I 2 — h in time 0(1). 

Theorem 1. There exists an (0(m + A log n), 0(log n)) data structure for answering forall 
queries. 

4.1 2-edge-connectivity 

As in the case of the connectivity relation, for each s E H(S'[a b]), we want to encode the 
entire history of what happens with s in each of the individual versions Gy ,... ,Gfe. Since we 
introduced path vertices in the graphs ^[a,^] > the appropriate fingerprints need to be defined in 
a more subtle way. 

We partition the vertices of a graph S[a,b] into three groups: 

1. simple vertices, 

2. vanishing path vertices. If s is a vanishing path vertex and it represents a path of 2-edge- 
connected components ci,..., c^, then for each x E [a, b], all the components ci,..., 
are parts of a single, larger 2-edge-connected component Cx in Gx, 

3. non-vanishing path vertices, s E H(S'[a (,]) is a non-vanishing path vertex if there exists 
X E [a,b] such that the underlying 2-edge-connected components ci,... ,Cfc are all actual 
2-edge-connected components of Gx- 
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Let us assume that P = [a,b] and s E V{Sp). Denote by cp{s,x) the result of 
Comp-Id(s, a, 6, x). As in the case of connectivity we define Hp{s) E |D(5p)|} U {±} 

to be the fingerprint of the sequence cp{s, a)... cp{s,b). We set Hp{s) =_L only if s is a 
non-vanishing path vertex. Otherwise, if s is simple or a vanishing path vertex, Hp{s) is an 
integer. 

There is a reason why a non-vanishing path vertex is not assigned an integer fingerprint: if 
vertices v,w & V, v ^ w are both represented by a non-vanishing path vertex s in S'p, then the 
sequences cp_t](l[o,oo]M,a) • • ■ C[i,t](l[o,oo]H,fe) and (l[o,oo]M>«)••■ C[i,t](l[ 0 ,oo]N,fe) might 
be different. 

In order to define the fingerprints Hp{s) based on the fingerprints for the children intervals 
left(P) and right(P), we first define the initial fingerprints Hp{s). The values Hp{s) have 
the same properties as the fingerprints Pp(s) defined above, except that if Hp{s) t^T, then 
Hp{s) is a pair of integers from the range [1, |I/(Sp)|]. The initial fingerprints can be computed 
according to the following rules, which implicitly decide whether a path vertex s is vanishing or 
non-vanishing. 

1. If s is a simple vertex: 

• if lp[s] =T or rp[s] =T, then Hp{s) = (s,0). The fingerprint has to be unique in 
this case. 

• Let Si = lp[s] and Sr = rp[s]. If Pieft(p)['S«] =T or Pright(P)[sr] =T, then Hp{s) = 
(s,0). It is a consequence of s representing exactly a single 2-edge-connected com¬ 
ponent of Gx, for some x E [a,b]. 

• Otherwise, Hp{s) = (Pieft(p)(sz),-f^right(P)(sr))- 

2. If s is a path vertex: 

• if lp[s] =T or rp[s] =T, then Pp(s) =T, 

• Let Si = lp[s] and Sr = rp[s]. If Pieft(p)[sz] =T or Pright(P)['Sr] =T, then Hp{s) =T. 

• Otherwise, Hp{s) = (Pieft(p)(sz),-f^right(P)(sr))- 

Again, we can use radix-sort to convert each value Hp{s) (distinct from T) to an integer 
Hp{s) in the range [1, |D(S'p)|]. 

We now sketch how to answer the query f orall2(tt, v, x, y) in time O(logt). As in Section^ 
we reduce this problem to answering O(logt) queries with the time period being an elementary 
interval. Let us focus on a single elementary interval P. For w E {n, u}, denote by the last 
interval on the path [l,t] —>■ P such that vertex w is represented in Sq^ by a vertex s^}. Also, 
we denote by the last interval on path [l,t] P such that vertex w is represented in Sqi^ 
by a simple vertex s'^. Denote the quadruple {Qw, Sw,Qwi ^’w) by 

We have the following cases: 

1- Qu 7^ Qv The answer is clearly false. 

2. Qu = Qv, Qu / P- As Su or Su could potentially be path vertices, the answer is positive 
if and only if {Q'^, s'J = (Q(,, s(,). 

3. Qu = Qv = P- 

(a) Hp{su) /T and Hp{sy) /T. The answer is true iff Hp{su) = Hp{su). 

(b) Hp{su) =T or Hp{sy) =T. The answer is the same as the result of the comparison 
{Q'u,s'u) = {Q'v,s'v)- 
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Once we have the fingerprints, all the above checks can be performed in 0(1) time, therefore 
we can answer forall2 queries in O(logt) time. 

In order to optimize the query time to O(logn), we adapt the technique used to speed 
up forall queries for connectivity. Assume again that D is the smallest integer such that 
2^ > n. First we show that we can answer the query forall2(u, u, x, y), where [x,y] C 
[/c-2^ + 1 , {k+l)-2^] for some k G [0, in time O(logn). We precompute the values 4>p{w) 

for each w ^ V and an elementary interval P not longer than 2^. As the number of elementary 
intervals not longer than 2^ is 0{t/n), we precompute 0{t) values in total. (I>p{w) can be 
computed based on </>par(P)(^^) ™ constant time, so we spend 0{t) time on precomputation. 

Let [x, y] C [k-2^ + l,{k + l)- 2^] and let Pi ,..., Pp be the partition of [x, y] into elementary 
intervals. As each Pi is a descendant of [k ■ 2^ 1, (A: + 1) • 2^] in the segment tree, the value 

4’[k-2^+i,{k+i)-20]i'‘^) can be used as a starting point to compute the values 4'p^{w). We need to 
descend only B — D levels down the tree to compute the values (pp^{w). Thus, the time needed 
to answer forall2(u, u, x, y) is 0{B — D) = O(logn) in this case. 

To handle the general query forall2(tt, u, x, y), let li be the smallest integer such that 
X < /i • 2^ + 1 and I 2 be the largest integer for which I 2 ■ 2^ < y holds. Our query can be 
split into the conjunction of three queries: f orall2(u, u, x, • 2^), f orall2(u, u, ^2 • 2^ + l,y) 
and f orall2(u, v, li ■ 2^ + 1 ,12 ■ 2^) (we assume the last query to be true if li = 12 ). The first 
two can be answered in O(logn) time, as discussed above. In order to answer the last query, 
we need to rephrase the check f orall2(tt, u, / • 2^ + 1, (/ + 1) • 2^) in terms of symbol equality 
= h}^. Indeed, for P; = [P 2'^ + 1, (/ + !)• 2^] and <^P;(rc) = {Qu,jSyj,Q'^,s'^) we can set: 


h 


i 

W 


'{Qn.,QL,s'J ifg^/P; 

^ {Qwj if Qw — Pi and Hp^i^Sw') 

if Pi nnd Pp^(s^(,) T . 


It can be easily verified that = h}^ if and only if the previously described checks for answering 
f orall2(n, v, x, y) where [x, y] = [P 2^ + 1, (/ + !)• 2^], produce a positive answer. 

We answer the query f orall2(n, u, • 2^ + 1 ,12 ■ 2^) by checking if the words ... /i^ 
and are equal. The needed words are all subwords of the word Xi... X^, where 

Xy = h^hl... The word length is 0{t). By Lemma [TTl after additional preprocessing 

in 0{t) time, we can answer the query f orall2(u, u,• 2'^ + 1,^2 • 2^) in constant time. 

Theorem 2. There exists an {0{m + t log n),0(log n)) data structure for answering forall2 
queries. 


5 Improved lower and upper bounds for exists queries 

In this section we focus on exists queries. We first give improved conditional lower bounds for 
answering these queries, and then show an algorithm, whose running time matches one of the 
new bounds. As shown in [9j, the problem of multiplying two Boolean n x re matrices can be 
reduced to the problem of answering Q{n?) exists queries about a graph timeline G*, where 
t = 0(re^). Denote by 0{n‘^ ) the time required to perform re x re Boolean matrix multiplication 
(BMM). Thus, unless iw' = 2, it is not possible to develop a data structure, which after almost 
linear preprocessing answers exists queries in polylogarithmic time. In this section we give 
several new lower bounds. 

Throughout this section, we repeatedly use e to denote an arbitrarily small, positive number. 
The exact value of e may vary and depend on the context. We also denote by 6(e) some other 
small positive number, dependent on e. 
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Let us recall the somewhat informal, yet important, partition of algorithms into algebraic 
and combinatorial. The combinatorial algorithms do not make use of the fact that the matrices 
are defined over a ring, i.e., they do not use subtraction. No 0(n^“^) combinatorial algorithm 
is known for BMM. 

We show a connection between the exists data structure and algorithmic problems re¬ 
lated to detecting triangles in graphs. In the triangle detection problem we are given a graph 
G = {V,E), where |ill| = m, and the goal is to find three vertices a,b,c € V such that 
{a,b),(a,c),(j3,c) E E. The best known known algorithm for triangle detection was given 
by Alon et al. [T] and works in time. The best combinatorial algorithm is folklore and 

runs in 0{my/rn) time. The following relation between triangle detection and BMM was shown 
in [TT] : 

Lemma 12. An combinatorial algorithm for triangle detection implies an 

combinatorial algorithm for BMM. 

The related problem is triangle listing, where we are asked to find c triangles in a graph 
with m edges. Patra§cu [TOj proved the following lemma. 

Lemma 13. If one can list m triangles from a graph with m edges in time, then 

there exists an algorithm for 3-SUM. 

We now show a relation between triangle listing and exists queries. 

Lemma 14. The problem of listing c triangles in a graph with m edges can be reduced to 
answering O^m + clogn) exists queries in a timeline G* of length t = 0{m) and no permanent 
edges. 

Proof. Let H be the input graph, in which we are supposed to list triangles. Moreover, let 
V{H) = {ui,...,Un}. We build a timeline G* of graphs on vertex set V{H) by processing 
vertices vi,.. . ,Vn one by one. First, we add an empty graph to G*. Then, for a vertex Vi, we 
append 2degff(vi) new versions to G* (degff(v) denotes the degree of vertex v in H), which we 
call a block of vertex Vi. Within each block, we first create degj;^(ui) new versions, at each step 
adding one more edge incident to Vi. The edges are added in arbitrary order. Then, we create 
degj:^(uj) more versions by removing the edges incident to Vi. Note that the last graph in every 
block is empty, and in the middle graph the vertex Vi has degree deg Let the the block 
of a vertex Vi start at Ga^ and end at G^^. 

Observe that we obtain a timeline G*, where t = dm -|- 1, as each edge of H is added and 
removed exactly twice. For each edge {vi,Vj) E E{G), i < j, we can test if there is a triangle 
{vi,Vj,Vk), where j < k, with a single query exists(uj, Vj, bj -\-l,t). Indeed, the answer to such 
a query is positive iff there exists Vk such that there is a path from Vi to Vj in Gaf.+degH{vk)-^- 
The path, along with the edge {vi,Vj), forms a triangle. 

Note that the query ex±sts{vi,Vj,ap,bq), for j < p < q, tells us if there is any triangle 
{vi,Vj,Vk) such that k E \p,q]. Thus, we may use a divide-and-conquer approach for listing 
triangles, which is based on the following observation. If we are looking for triangles such that 
k & \p,q], 8i negative answer to an ex±sts{vi,Vj,ap, 6 (p+q)/ 2 ) query allows us to halve the search 
interval. Hence, we can find all I vertices Vk such that {vi,Vj,Vk) is a triangle in time 0{l logn). 
The detailed procedure Report-Triangles is given in Appendix lAl □ 

By combining Lemmas fT^ [13] and [T41 we obtain the following. 

Theorem 3. Let dt be a problem of answering 0(f) exists queries about an arbitrary graph 
timeline G* with no permanent edges. 
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• An algorithm for 'I' implies an algorithm for triangle finding. 

• An 0{t^'^~'") combinatorial algorithm for implies an combinatorial algorithm 

for BMM. 

• An algorithm for 'I' implies an algorithm for 3-SUM. 

In addition, we show that an exists data structure with preprocessing/query time product 
of 0{t‘^~^) and queries substantially faster than 0(\/t) implies a faster BMM algorithm. 

Lemma 15. Suppose there exists an {0{t^~‘^~'^),0{t^)) combinatorial data structure for an¬ 
swering exists queries, where q G [0, |) is a parameter. Then there exists an 
eomhinatorial algorithm for BMM. 

Proof. We use the assumed algorithm for answering exists queries to develop an 
combinatorial algorithm for BMM. By Lemma fT^ in order to obtain such an algorithm, it 
suffices to show an algorithm for triangle detection. This in turn, by Lemma [TTl 

can be reduced to answering 0{m) exists queries in a graph timeline G* with no permanent 
edges, where t = 0{m). To complete the proof, we show how to answer these queries in 
total time. 

Split G* into blocks of length consisting of consecutive versions, where a € [0,1] is to be set 
later. Let Gq, ..., G;, be one of the blocks. First, we compute in time 0(t) the edges alive during 
entire block and contract the components formed by those edges. Next, we mark the components 
subject to any updates in the interval [a,b]. There are 0(t“) marked components. All the 
queries exists(n, u, c, d) such that [c,d] C [a,b] and concerning the unmarked components, can 
be answered in constant time — if suffices to check whether components of u and v are the same. 
For the remaining 0{t°‘) components we build the assumed data structures. The initialization 
takes time + t) per individual block, which gives time for 

blocks. 

We can answer an exists(u, u, x, y) query by going through at most blocks inter¬ 
secting [x, y\ and querying each block structure once. Thus, the query overhead is = 

By setting a = 2 - 2 q-€ '> have a G ( 5 , 1 ]- The exponent of the initialization time becomes 
less than |, as we have: 


1 -I- a(l - q - e) 


1 + 


1 — q — e 

2 — 2q — e 


< 1 + 


1 — q — e 
2-2q-2e 


3 

2 ’ 


2 


-a<2-i 


3 

2 ’ 


whereas the exponent of time needed to answer 0{t) queries is 


1 + 1 


a(l — q) = 2 — 


1-g 

2 — 2g — e 


< 2 - 


1- g 

2- 2q 


3 

2 ' 


The lemma follows. 


□ 


What is interesting, we can give a combinatorial data structure, whose running time matches 
the above lower bound. 

Theorem 4. For every 0 < a < 1 there exists an (0(m + min(nt, 0(t“)) data strueture 

for answering exists queries. It uses 0(min(nt, space. 
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Proof. If re = then nt = and the {0{m + nt), 0(1)) data structure for answering 

exists queries from [9] is sufficient to finish the proof. 

Let us assume that re = We first build the tree-like structure T in 0{m + flog re) 

time. Let D be the largest integer such that 2^ < t". We split the graph timeline into blocks of 
size 2^“^, which is roughly Denote the versions of the i-th block by Gai ,■■■■, GfJ^. Observe 

that [ai,bi] is an elementary interval. 

Denote by li{v) the pair {s,P), where P is the last interval on a path from the root [l,t] to 
[ai,bi] such that u G D is represented by s in Sp. 

Consider answering the block query exists{u,v, x,y), where [x,y] C [ai,bi]. Set li{v) = 
{sv,Pv) and li{w) = {sw,Pw)- If Pv / Pw, then the answer is clearly false. Otherwise, if 
Pv 7^ [o-ijbi], then neither v nor w are represented in and thus the answer is the same 

as the result of a comparison = Sw In the last case, when P^ = P^ = [ai,6j], the result 
can vary. However, we can use the {0{m -|- ret), 0(1)) data structure for answering exists 
queries from |2]. For each i, we build this structure for a graph with vertices V{S[a.i,bi]) and the 
timeline induced by the subsequent edge updates in graphs 0^^,... . The size of H(5[a.^ft.]) 

as well as the length of this timeline is 0(5j — Oj) = 0(t^“"), so it can be initialized in time 
0((t^““)^) to answer queries in 0(1) time. In a single query, it gives us the desired answer to 
exists{sy,Sw,x,y). 

As a result, once we have found li{v) and li{w), we can answer the block query in constant 
time. Observe that a general exists query can be answered by issuing 0(t") block queries. All 
the required 0(t“) lj{u) values can be computed by traversing the first D levels of T, which can 
be implemented to work in time We need to build exists data structures — each 

in time 0((t^““)^), so the total initialization time is 0(rre -|- -|- flog re) = 0(rre -|- □ 

6 Open problems 

For forall and forall2 queries, we gave an (0(rre-|-t logre), O(logre)) data structure. What 
about the biconnectivity? Although it is possible to propose a similar tree-like structure that 
represents biconnectivity in individual versions, it seems hard to extend it to forall2-like 
queries. The main obstacle is biconnectivity relation on vertices not being an equivalence 
relation. 

It would be also interesting to know whether even faster query (without sacrificing 0{t log re) 
initialization time) is possible for forall queries. 

Concerning exists queries, we proved that beating our trade-off structure in the domain 
of combinatorial algorithms implies a faster combinatorial matrix multiplication algorithm. 
However, is there a way to employ fast matrix multiplication to obtain a data structure for 
exists queries with preprocessing/query time product of 0(f^“^)? 
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A Omitted pseudocode 


1: function COMP-lD(ri), a, 6, c) > w £ V{S[a,i,]),c £ [a,b] 

2: if a = 6 then 

3: return (rc, [a, 6]) 

4: mid := [^J 

5: if c < mid then 

6: if l[a^6][rc] =T then 

7: return (rc, [a, 6]) 

8: else 

9: return COMP-lD(l[a[re],a,mid,c) 

10: else 

11: if [rc] =T then 

12: return (w, [a, 6]) 

13: else 

14: return COMP-lD(r[„[re], mid J-1, 6, c) 
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function Forall-Aux(si, S 2 ,x, y, a, b) 
if Si = S 2 then 
return true 

if [x, y] = [a, b] then i> [x, y] is elementary, we refer to fingerprints 

if Hia,b]{si) = Hia,b]{s 2 ) then 
return true 
else 

return false 

mid := 

if X < mid then 

if Ma,b][^i] =-L or l[a,b][s 2 ] =± then 
return false 

if not FORALL-Aux(l[a^fe][si], [s 2 ], aJ, mid, o, mid) then 
return false 
if y > mid then 

if r[a,6]['Si] =-L or r[a^b][s 2 ] =± then 
return false 

if not FORALL-Aux(r[Q ;,][si], r[a [s 2 ], mid + 1, ?/, mid + 1, 6) then 
return false 
return true 


procedure Report-Triangles(G) 

construct the timeline G* from the proof of Lemma O along with numbers a*, bi 
for {u,v) € E{G) do > u <v 

report-internal(u, X, X + 1, |F(G)|) 

procedure REPORT-lNTERNAL(tt, v, x, y) 

if X > y then 
return 

if not exists(u,v,ax,by) then 
return 
if X = y then 

output triangle {u, v, x) 
if k triangles have been reported so far, stop 
return 
mid := 

Report-Internal(u, V, x, mid) 

REPORT-lNTERNAL(ri, V, mid + 1, y) 
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